Emotional speech synthesis for emotionally-rich virtual worlds

نویسنده

  • Marc Schröder
چکیده

This paper aims to give a brief overview of the current state of the art in emotional speech synthesis in view of a multi-modal context. After a brief introduction into the concept of text-to-speech synthesis, two approaches to the expression of emotions in speech synthesis are described. The categorical approach models emotions as discrete categories and is able to provide high-quality emotional speech for a few emotion categories; the dimensional approach uses emotion dimensions such as activation and evaluation to model essential emotional properties, leading to more flexible but less specific expressions. Architectural requirements for an audio-visual integration are outlined. Three examples of demonstrators illustrate the types of applications we currently envisage. Finally, the question of validation of a generation system is formulated, and a direction for the development of possible answers is suggested.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Trackside DEIRA: a dynamic engaging intelligent reporter agent

DEIRA is a virtual agent commenting on virtual horse races in real time. DEIRA analyses the state of the race, acts emotionally and comments about the situation in a believable and engaging way, using synthesized speech and facial expressions. In this paper we discuss the challenges, explain the computational models for the cognitive, emotional and communicative behavior, and account on impleme...

متن کامل

Automatic Recognition of Emotionally Coloured Speech

Emotion in speech is an issue that has been attracting the interest of the speech community for many years, both in the context of speech synthesis as well as in automatic speech recognition (ASR). In spite of the remarkable recent progress in Large Vocabulary Recognition (LVR), it is still far behind the ultimate goal of recognising free conversational speech uttered by any speaker in any envi...

متن کامل

Perception of emotional congruency in multimodal speech synthesis

This working paper experimentally investigates the perception of emotional congruency in multimodal speech synthesis. Therefor two perceptual experiments are described. Experiment 1 is a preliminary test exploring inhowfar subjects are able to identify emotions in synthetic speech as well as in faces presented in short video-clips. Results show that subjects find it easier to recognize emotions...

متن کامل

Verification of Acoustical Correlates of Emotional Speech using Formant-Synthesis

This paper explores the perceptual relevance of acoustical correlates of emotional speech by means of speech synthesis. Besides, the research aims at the development of »emotionrules« which enable an optimized speech synthesis system to generate emotional speech. Two investigations using this synthesizer are described: 1) the systematic variation of selected acoustical features to gain a prelim...

متن کامل

Verification of Acousical Correlates of Emotional Speech using Formant− Synthesis

This paper explores the perceptual relevance of acoustical correlates of emotional speech by means of speech synthesis. Besides, the research aims at the development of »emotion− rules« which enable an optimized speech synthesis system to generate emotional speech. Two investigations using this synthesizer are described: 1) the systematic variation of selec− ted acoustical features to gain a pr...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003